sampling distribution

Definition and Purpose

A sampling distribution is the distribution of statistics from many samples (all with the same sample size)

Understanding Dots in Sampling Distributions

Each dot in a sampling distribution for the mean represents a single sample mean (x¯) calculated from one sample of size n. The position of the dot on the x-axis shows the value of that sample mean.

How Dots Are Created

  1. Take a random sample of size n from the population
  2. Calculate the mean (x¯) for this sample
  3. Plot this mean as one dot
  4. Repeat many times with new samples of the same size

What the Pattern Tells Us

The overall pattern of dots reveals important information about the sampling distribution:

Example with Student Heights

Imagine measuring student heights:

Key Concepts

Concept Description
Purpose Shows variability of statistics across samples due to random chance
Practice Usually work with one sample, but concept helps understand variability
Application Used in inference and hypothesis testing

Types of Distributions

Type Symbol Description Example
Population Distribution μ, σ Distribution of all values in population All student heights
Sample Distribution x¯, s Distribution of values in one sample Heights of 30 students
Sampling Distribution μx¯, SE Distribution of statistics from many samples Means of many samples

Construction Process

Step Action Notes
1. Population Start with population Often unknown in practice
2. Sampling Take sample of size n Must be random
3. Calculate Compute desired statistic Mean, proportion, etc.
4. Repeat Many times with same n Typically 1000+ times
5. Plot Create distribution of statistics Usually approximately normal

Properties

Center and Shape

Property Description Mathematical Form
Center Equal to population parameter E(x¯)=μ
Shape Approaches normal distribution Via Central Limit Theorem
Bias Unbiased if center = parameter Bias=E(statistic)parameter

Variability Measures

Measure Symbol Formula Description
Population SD σ - Measures population variability
Sample SD s (xx¯)2n1 Estimates population variability
Standard Error SE σn Measures sampling variability

Central Limit Theorem (CLT)

Key Points

Aspect Description
Distribution Shape Approaches normal as n increases
Requirements Independent random samples
Sample Size Usually n30 is sufficient
Application Works regardless of population shape

Effect of Sample Size

Change in n Effect on Sampling Distribution
Increases More normal shape
Increases Smaller SE (1n)
Increases Better estimate of parameter

Example: Movie Budgets (2012-2018)

Population Parameters

Parameter Value
Population Size 1,056 movies
Population Mean (μ) $51.38 million
Time Period 2012-2018

Sampling Distribution Characteristics

Sample Size Characteristics
n=10 High variability, less normal
n=1000 Low variability, more normal

Mathematical Relationship

For means:

SEx¯=σn

For proportions:

SEp^=p(1p)n

These concepts are fundamental for: